Approximation Schemes for 
the Betweenness Problem in Tournaments 
and Related Ranking Problems 



Marek Karpinski* Warren Schudy^ 



Abstract 

We design the first polynomial time approximation schemes (PTASs) for tlie Mini- 
mum Betweenness problem in tournaments and some related higher arity ranking prob- 
lems. This settles the approximation status of the Betweenness problem in tournaments 
along with other ranking problems which were open for some time now. The results 
depend on a new technique of dealing with fragile ranking constraints and could be of 
independent interest. 

1 Introduction 

We study the approximability of the Minimum Betweenness problem in tournaments (see [2j) 
that resisted so far efforts of designing polynomial time approximation algorithms with a constant 
approximation ratio. For the status of the general Betweenness problem, see e.g. [9l [2l [8]. 

In this paper we design the first polynomial time approximation scheme (PTAS) for that prob- 
lem, and generalize it to much more general class of ranking CSP problems, called here fragile 
problems. To our knowledge it is the first nontrivial approximation algorithm for the Betweenness 
problem in tournaments. 

In the Betweenness problem we are given a ground set of vertices and a set of betweenness 
constraints involving 3 vertices and a designated vertex among them. The cost of a ranking of 
the elements is the number of betweenness constraints with the designated vertex not between the 
other two vertices. The goal is to find a ranking minimizing this cost. We refer to the Betweenness 
problem in tournaments, that is in instances with a constraint for every triple of vertices, as the 
BetweennessTour or fully dense Betweenness problem (see [2]). We consider also the A:-ary 
extension fc-FAST of the Feedback Arc Set in tournaments (FAST) problem (see p31 fH [3]). 

We extend the above problems by introducing a more general class of fragile ranking fc-CSP 
problems. A constraint 5 of a ranking fc-CSP problem is called fragile if no two rankings of the 
vertices S that both satisfy the constraint differ by the position of a single vertex. A ranking A;-CSP 
problem is called fragile if all its constraints are fragile. 

We now formulate our main results. 
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Theorem 1. There exists a PTAS for the BetweennessTour problem. 



The above answers an open problem of [2j on the approximation status of the Betweenness 
problem in tournaments. 

We now formulate our first generalization. 

Theorem 2. There exist PTASs for all fragile ranking k-CSP problems in tournaments. 

Theorem [2] entails, among other things, existence of a PTAS for the k-aiy extension of FAST. 
A PTAS for 2-FAST was given in [15]. 

Corollary 1. There exists a PTAS for the fc-FAST problem. 

We generalize BetweennessTour to arities A; > 4 by specifying for each constraint S a pair 
of vertices in S that must be placed at the ends of the ranking induced by the vertices in S. Such 
constraints do not satisfy our definition of fragile, but do satisfy a weaker notion that we call weak 
fragility. The definition of weakly fragile is identical to the definition for fragile except that only four 
particular single vertex moves are considered, namely swapping the first two vertices, swapping the 
last two, and moving the first or last vertex to the other end. We now formulate our most general 
theorem. 

Theorem 3. There exist PTASs for all weak- fragile ranking k-CSP problems in tournaments. 

Corollary 2. There exists a PTAS for the fc-BETWEENNESSToUR problem. 

Additionally our algorithms are guaranteed not only to find a low- cost ranking but also a 
ranking that is close to an optimal ranking in the sense of Kendall-Tau distance. Karpinski and 
Schudy [H] recently utilized this extra feature to find an improved parameterized algorithm for 
BetweennessTour with runtime 2'^'^V*^^'^/") + n'^W_ 

Theorem 4. The PTASs of Theorem\^ additionally return a set of 2^^^^'^^ rankings, one of which 
is guaranteed to be both cheap (cost at most (1 + 0{e))OPT) and close to an optimal ranking 



All our PTASs are randomized but one can easily derandomize them by exhaustively considering 
every possible random choice. 

Section [2] introduces notations and the problems we study. Section [3] introduces our algorithm 
and an intuitive sense of why it works. Section S] analyzes the runtime. The remaining sections 
analyze the cost of the output of our algorithms. 

2 Notation 

First we state some core notation. Throughout this paper let V refer to the set of n objects 
(vertices) being ranked and e > the desired approximation parameter. Our O(-) hides k but not 
e or n. Our O(-) additionally hides (log(l/e))'^'^^''. A ranking is a bijective mapping from a ground 
set 5 C y to {1, 2, 3, . . . ,15*1}. An ordering is an injection from S into M. We use tt and a (plus 
superscripts) to denote rankings and orderings respectively. Let vr* denote an optimal ordering and 
OPT its cost. We let (^) (for example) denote the standard binomial coefficient and (]^) denote 
the set of subsets of set V of size k. 



(Kendall-Tau distance O 
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Figure 1: An illustration of fragility. For a constraint to be fragile all the illustrated single vertex 
moves must make any satisfied constraint unsatisfied. 

For any ordering a let Ranking{a) denote the ranking naturally associated with a. To help 
prevent ties we relabel the vertices so that V = {1,2,3, ... , \V\}. We will often choose to place 
u in one of 0(l/e) positions P(it) = {jen + ti/(n + 1),0 < j < 1/e} (the u/{n + 1) term breaks 
ties). We say that an ordering is a bucketed ordering if a{u) G V{u) for all u. Let Round{7r) denote 
the bucketed ordering corresponding to vr (rounding down), i.e. Round{n){u) equals 7r{u) rounded 
down to the nearest multiple of en, plus u/{n + 1). 

Let v>-^p denote the ordering over {v} which maps vertex v to position p G M. For set Q of 
vertices and ordering a with domain including Q let ag denote the ordering over Q which maps 
u € Q to (7{u), i.e. the restriction of a to Q. For orderings and cr^ with disjoint domains let 

I cj^ denote the natural combined ordering over Domain{a^) U Domain{a'^). For example of our 
notations, cjq | v^p denotes the ordering over QU {v} that maps v to p and u £ Q to a{u). 

A ranking /c-CSP consists of a ground set V of vertices, an arity k > 2, and a constraint 
system c, where c is a function from rankings of k vertices to {0, 1}0 Note that c depends on the 
names of the vertices in the domain of its argument. In particular if ui, U2 and u^, are vertices then 
c(wii— 7-1 I ti2i— >-2), c(ti2i— )-l I Mil— 7-2) and c(ti2i— )-l | ^31— 7-2) are all different (although c('Uii— )-l | n2i— 7-2) and 
c{u2^2 I Mil— )•!) are the same). We say that a subset S" C ^ of size k is satisfied in ordering o" of S if 
c{Ranking{a)) = 0. For brevity we henceforth abuse notation and omit the ^^Ranking" and write 
simply c(cj). The objective of a ranking CSP is to find an ordering a (w.l.o.g. a ranking) minimizing 
the number of unsatisfied constraints, which we denote by C^((t) = J2se(°°"^'''"'-''^) '^('^s)- We will 
frequently omit the superscript c, in which case it should be understood to be the constraint system 
of the overall problem we are trying to solve. 

Abusing notation we sometimes refer to 5 C y as a constraint, when we really are referring 
to c applied to orderings of S. A constraint S is fragile if no two orderings that satisfy it differ 
by the position of a single vertex. In other words constraint S is fragile if c{its) + c(7r'5') > 1 for 
all rankings vr and tt' over S that differ by a single vertex move, i.e. vr' = Ranking{v^p \ t^s\{v}) 
for some f G 5 and p G (Z + 1/2). An alternate definition is that a satisfied fragile constraint 
becomes unsatisfied whenever a single vertex is moved, which is why it is called "fragile." Fragility 
is illustrated in Figured) 

A constraint S is weakly fragile if c{tts) + c{'n' s) > 1 for all rankings tt and vr' that differ by a 
swap of the first two vertices, a swap of the last two, or a cyclic shift of a single vertex. In other 



Our results transparently generalize to the [0, 1] case as well, but the 0/1 case allows simpler terminology. 
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Figure 2: An illustration of weak fragility. For a constraint to be weak fragile all the illustrated 
single vertex moves must make any satisfied constraint unsatisfied. 



words vr' = Ranking{v^p \ tts\{v}) for some v ^ S and p G M with {7r{v),p) G {(1, 2 + (1, k + 

{k, A; — |), (k, ^)}. Observe that this is equivalent to ordinary fragility for A; < 3. Weak fragility 
is illustrated in Figure [2j 

Our techniques handle ranking CSPs that are fully dense with weakly fragile constraints, i.e. 
every set S oi k vertices corresponds to a weakly fragile constraint. Fully dense instances are also 
known as tournaments. 

Let b'^{cr, v,p) denote the cost of the constraints involving vertex v in ordering crDom.ain{a)\{v} I 
p formed by moving v to position p in ordering a. Formally b'^{a,v,p) = ^q-... c{aQ \ vt-^p), where 
the sum is over sets Q C Domain{a) \ {v} of size k — 1. Note that this definition is valid regardless 
of whether or not v is in Domain(a). The only requirement is that the range of a excluding cr(v) 
must not contain p. This ensures that the argument to c(-) is an ordering (injective). We will 
usually omit the superscript c (as with C). 

We call a non-negative weight function {wuv}u,v€U over the edges of the complete graph induced 
by some vertex set U a feedback arc set (FAS) instance. We can express the feedback arc set 

problem in our framework by the correspondence c(ui— )-x | v^y) = { .-i ■ ■ Abusing 



^ Wuv otherwise 

notation slightly we also write C"'(cr) for C"^(cj) with the above c. More concretely C"'(cj) = 
T.u,v:a{u)>cj{v) Wuv Similarly we write b'"{a,v,p) = Y.u^v { j[ ^{j^j < ^ • If a FAS instance 

satisfies a < Wuv + Wvu < /3 for all v and some a, /? > we call it a (weighted) feedback arc set 
tournament (FAST) instance. It is easy to see that FAST captures all possible fragile constraints 
with k = 2. We generalize to /c-FAST as follows: a fc-FAST constraint over S is satisfied by one 
particular ranking of S and no others. Clearly fc-FAST constraints are fragile. 

We generalize BetweennessTour to /c > 4 as follows. Each constraint S designates two 
vertices {u,v}, which must be the first and last positions, i.e. if vr is the ranking of the vertices in 
5 then c(7r) = 1 {{Tr{u),TT{v)} / {1,^;}). It is easy to see that BetweennessTour constraints are 
weakly fragile. 

We use the following two results from the literature. 

Theorem 5 (tl5j). Let w be a FAS instance satisfying a < Wuv + w^u ^ P for a,/3 > and 
j3/a = 0(1). There is a PTAS for the problem of finding a ranking tt minimizing C'^{tt) with 
runtime n'~'^^^2^^^^^^\ 

Theorem 6 (e.g. [6l[T6]). For any k-ary MIN-CSP and 5 > Q there is an algorithm that produces 
a solution with cost at most 6n^ more than optimal. Its runtime is n^^^^2'^^^/^'^\ 

Theorem [6] entails the following corollary. 
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Corollary 7. For any 6 > and constraint system c there is an algorithm AddApprox for the 
problem of finding a ranking vr with C(-7r) < C(7r*) + 6n'' , where it* is an optimal ranking. Its 
runtime is n'^(i)2'^(^/^^) . 

3 Intuition and algorithm 

We are in need for some new techniques different in nature from the techniques used for weighted 
FAST [IS]. 

Our first idea in this direction is somehow analogous to the approximation of a differentiable 
function by a tangent line. Given a ranking vr and any ranking CSP, the change in cost from 
switching to a similar ranking vr' can be well approximated by the change in cost of a particular 
weighted feedback arc set problem (see proof of Lemma [23]) . Furthermore if the ranking CSP 
is fragile and fully dense the corresponding feedback arc set instance is a (weighted) tournament 
(Lemma [T7|) . So if we somehow had access to a ranking similar to the optimum ranking vr* we 
could create this FAST instance and run the existing PTAS for weighted FAST [T5j to get a good 
ranking. 

We do not have access to vr* but we can use a variant of the fragile techniques of [13] to get 
close. We pick a random sample of vertices and guess their location in the optimal ranking to 
within (an additive) en. We then create an ordering greedily from the random sample. We show 
that this ordering is close to vr*, in that |7r*(t;) — cr^{v)\ = 0{en) for all but 0{en) of the vertices 
(Lemma I12p . 

We then do a second greedy step (relative to a^), creating cr^. We then identify a set U of 
unambiguous vertices for which we know |vr*(f) — o"^(f)| = 0{en) (Lemma [T6|l . We temporarily 
set aside the 0{OPT / {en^-^)) (Lemma [H]) remaining vertices. These two greedy steps are sim- 
ilar in spirit to previous work on ordinary (non-ranking) everywhere-dense fragile CSPs [13j but 
substantially more involved. 

We then use cr^ to create a (weighted) FAST instance w that locally represents the CSP. It would 
not be so difficult to show that w \s a, close enough representation for an additive approximation, 
but we want a multiplicative 1 + e approximation. Showing this requires overcoming two obstacles 
that are our main technical contribution. 

Firstly the error in o"^ causes the weights of w to have significant error (Lemma I19p even in the 
extreme case of OPT = 0. At first glance even an exact solution to this FAST problem would seem 
insufficient, for how can solving a problem similar to the desired one lead to a precisely correct 
solution? Fortunately FAST is somewhat special. It is easy to see that a zero-cost instance of 
FAST cannot be modified to change its optimal ranking without modifying an arc weight by at 
least 1/2. We extend this idea to cases where OPT is small but non-zero (Lemma [23]). 

The second obstacle is that the incorrect weights in FAST instance w may increase the optimum 
cost of w far above OPT, leaving the PTAS for FAST free to return a poor ranking. To remedy this 
we create a new FAST instance w by canceling weight on opposing arcs, i.e. reducing Wuv and w^u 
by the same amount. The resulting simplified instance w clearly has the same optimum ranking 
as w but a smaller optimum value. The PTAS for FAST requires that the ratio of the maximum 
and the minimum of Wuv + Wvu must be bounded above by a constant so we limit the amount of 
cancellation to ensure this (Lemma [T7|) . It turns out that this cancellation trick is sufficient to 
ensure that the PTAS for FAST does not introduce too much error (Lemma I20p . 

Finally we greedily insert the relatively few ambiguous vertices into the ranking output by the 
PTAS for FAST [l5] (Appendix [8]). 
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Algorithm 1 A 1 + 0(e)-approximation for weak fragile rank /c-CSPs in tournaments. 

Input: Vertex set y, |y| = n, arity k, system c of fully dense arity k constraints, and approximation 

parameter e > 0. 

1: Run ADDAppROx(e^n'^) and return the result if its cost is at least e^n'^ 

2: Pick sets Ti, . . . ,Tt uniformly at random with replacement from {j^i) , where t = ^^^rly^- 

Guess (by exhaustion) bucketed ordering a^, which is the restriction of Round{-K*) to the 
sampled vertices IJj^«' where tt* is an optimal ranking. 
3: Compute bucketed ordering greedily with respect to the random samples and a^: 

a\u) = argminpgp(„) b{u,p) where b{u,p) = Ei:«0T. | v^p). 

4: For each vertex v: If b{a^,v,p) < 13k^3^~^e{2Zi) ^01 some p E 'P{v) then call v unambiguous 
and set cr^ (v) to the corresponding p (pick any if multiple p satisfy) . Let U denote the set of 
unambiguous vertices, which is the domain of bucketed ordering a"^. 

2 

5: Compute feedback arc set instance over unambiguous vertices U with weights tv'^^ (see text). 
Solve it using the FAST PTAS [TH]. Do single vertex moves until local optimality (with respect 
to the FAST objective function), yielding ranking vr^ of U. 

6: Create ordering fj^ over y defined by cT'^(ti) = I ^ n ^ 

1^ argmmp^^/(„_^i)+j-o<j<„6(7r^u,p) otherwise 

In other words insert each vertex v ^ V \ U into ^^{v) greedily. 
7: Return tt^ = Ranking{a^). 



For any ordering a with domain U we define a weighted feedback arc set instance {w'^^}uv as 
follows. Let w'^y equal the number of the constraints {u, f } ^ S C [/ with c{a') = 1 where (1) 
a' = {(7s\{v} I v^p), (2) p = cr(n) — 5 if a{v) > a{u) and p = a{v) otherwise, and (3) 5 > 
is sufficiently small to put p adjacent to a{u). In other words if v is after u in cr it is placed 
immediately before v in a'. Observe that < Wuv ^ ('fc^^)- ^'^^ orderings a and a' we 

use the abbreviation (a) = (a). The following Lemma follows easily from the definitions. 

Lemma 8. For any ordering a we have (1) Cia) = i^^C{a) and (2) b""" ia,v,a{v)) = (k - 1) ■ 
b{a, V, cr{v)) for all v. 

Proof. Observe that all Wuv that contribute to C^{a) or b"^" {a,v,a{v)) satisfy a{u) > a{v) and 
hence such equal to the number of constraints containing u and v that are unsatisfied in 

a. The (2) and k — 1 factors appear because constraints are counted multiple times. □ 

For any ordering a we define another weighted feedback arc set instance 
w'^y = w'^y — min( -^Q ('^^"2^) > '^uvt''^vu)j where U is the domain of a. For any orderings a and a' 
let C'^{a') = C^" {a'). Observe that the feedback arc set problems induced by w and w have the 
same optimal rankings but w has a smaller objective value and hence they are not equivalent for 
approximation purposes. In other words C°"(7r') — C°'{tt°) = C^{tt') — C"{ti°) for all rankings vr' 
and 7r°. 

For any orderings a and a' with domain [/, we say that {u,v} C [/ is a a / a' -inversion if 
a{u) — a{v) and (t'{u) — cr'{v) have different signs. Let d{a, a') denote the number of cr/cr'-inversions 
(a.k.a. Kendall Tau distance). We say that v does a left to right {a,p, a' ,p') -crossing if a{v) < p and 
(7'{v) > p' . We say that v does a right to left {a, p/ a' ,p') -crossing if a{v) > p and (7'{v) < p' . We 
say that v does a {(7,p, a' ,p') -crossing if v does a crossing of either sort. We say that u a/a' -crosses 
p E M if it does a ((T,p, (T',p)-crossing. 
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With these notations in hand we now formahze the ideas described in our Algorithm [TJ The non- 
deterministic "guess (by exhaustive samphng)" on hne 2 of our algorithm should be implemented in 
the traditional manner: place the remainder of the algorithm in a loop over possible orderings of the 
sample, with the overall return value equal to the best of the tt^ rankings found. Our algorithm can 
be derandomized by choosing Ti, . . . ,Tt non-deterministically rather than randomly; see Section U] 
for details. 

If OPT > e^n^ then the first line of the algorithm is sufficient for a PTAS so for the remainder 
of the analysis we assume that OPT < e^n^ . For most of the analysis we actually need something 
weaker, namely that OPT is at most some sufficiently small constant times e^n^. We only need 
the full OPT < e'^n^ in one place in Section [8l 



4 Runtime analysis 

By Theorem [7] the additive approximation step takes time n'-^^^^2'-^^^^''^°\ There are at most 
l^l/^y-ik-i) ^ 2^(Vf) bucketed orderings to try The PTAS for FAST takes time n'^W2'^W^'') 
by Theorem [5l The overall runtime is 

Derandomization increases the runtime of the two algorithms that we use as subroutines to 
j^poiy(i/e)_ There are at most n^'^^~^^ = rP^^I^) possible sets Ti,...Tt that the derandomized 
algorithm must consider. Therefore the overall runtime is 



5 Analysis of 

Let = Round{'K*). Call vertex v costly if 6((t^, v, a^{v)) > '^{^j^i^-i) ^^^'^ non-costly otherwise. 



Lemma 9. The number of costly vertices is at most 



k-OPT 



Proof. At most an e fraction of all pairs of vertices are a 7r*/o"^-inversion. Therefore by union 
bound at most an fraction of the (^Zi) possible constraints involving any particular vertex v 
contain a 7r*/cr'^-inversion. Therefore for any costly v we have 



Rearranging we get 

Finally observe that kC{TT*) = ^^h{TT* ,v,tt* {v)) > (number costly)e(2) (^l];) , completing the 
proof. □ 

Lemma 10. Let a be an ordering ofV, \V\ = n, v £ V be a vertex andp,p' £ M. Let B be the set 

of vertices (excluding v) between p and p' in a. Then b{a,v,p) + b{a,v,p') > (^^^J-l^lk-i ik-i) ■ 
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Proof. By definition 



b{a, v,p) + b{a, v,p') = ^ [c{aQ \ v^p) + c((Tq | v^p')] (1) 

Q:- 

where the sum is over sets Q '^U \ {v} of A: — 1 vertices. 

We first consider the ihustrative special case of betweenness tournament (or more generally 
fragile problems with arity k = 3). Betweenness constraints have a special property: the quantity 
in brackets in ([1]) is at least 1 for every Q that has at least one vertex between p and p' in vr. There 
are at least \B\{n — 2)/2 such sets, which can easily be lower-bounded by the desired f^^J^^-s-i (3Z1) ■ 

Returning to the general case of weak fragility, observe that the quantity in brackets in ([1]) is 
at least 1 for every Q that either has all A: — 1 vertices between p and p' in cj^ or has one vertex 
between them and the remaining k — 2 either all before or all after. To lower-bound the number of 
such Q we consider two cases. 

If \B\ > \V\/3 then the number of such Q is at least (j.^^) = i~T('fc-V) — 2-{k-i)3''-^ (fc-^) 
sufficiently large n. 

If \B\ < \V\/3 then either at least |T^|/3 vertices are before or at least |T^|/3 vertices are after 
hence the number of such Q is at least |-B|(']fy2^) > 2.3^-2 (fc'D — (k-iys'^-'^ (fc~2) ^'-'^ sufficiently 
large n. □ 

For vertex v we say that a position p £ Vi^v) is v-out of place if there are at least 6(2)3'^""^ en 
vertices between p and a^{v) in a^. We say vertex v is out of place if <t^{v) is v-out of place. 

Lemma 11. The number of non- costly out of place vertices is at most en/2 with probability at least 
9/10. 

Proof. Focus on some v € V and p G V{v). From the definition of out-of-place and Lemma [TO] we 
have 

for any f-out of place p. Next recall that for non-costly v we have 

6(a°,.,a°(.))<2(^)/^:J) (2) 



hence 



for any v-ont of place p. 
Recall that 



b{a^,v,p)>4%i'^ T) (3) 



2 \k-l 



b{v,p) = ^ V ciaWv^p) 



t ^ 

for any p. Each term of the sum is a 0/1 random variable with mean = , \ ^ X^Qgf ^ \v4q'^^'^'^ Q 
p) = i^6(cj°,z;,p). Therefore E \b{v,p) \ = b{a^,v,p). We can bound ^((t°(u)) < ^Q^ClZl) / (k-i) = 
M using ([2]). For any v-out of place p we can bound n{p) > 2M by ([3]). 
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We can bound the probabihty that sum m b{v,a'^{v)) is at least (1 + l/3)Mt using a Chernoff 
bound as 

-H«, . »p (-^ ^ : ;) « i) . 

for sufficiently large n. Similarly for any u-out of place p we can bound the probability that 
b{v,p) is at most (1 - l/3)Mt by exp{-{l/3f Mt/2) < {e/AOf. Therefore by union bound the 
probability of some v-out of place p having b{v,p) too small is at most < e/40. Clearly 

4(1 — 1/3) > 2(1 + 1/3) so each vertex v is out of place with probability at least e/20. A Markov 
bound completes the proof. □ 

Lemma 12. With probability at least 9/10 the following are simultaneously true: 

1. The number of out of place vertices is at most en. 

2. The number of vertices v with \cr^{v) — a^{v)\ > 3A;^3'^~^en is at most en 

3. d{a^,a^) < Qk'^?,^-^en'^ 

Proof. By Lemma [9] and the fact OPT < e'^n^ we have at most ^ en/2 costly vertices for 

n sufficiently large. Therefore Lemma [TT] implies the first part of the Lemma. We finish the proof 
by showing that whenever the first part holds the second and third parts hold as well. 

Observe that there are exactly en vertices in between any two consecutive positions in V{v). 
It follows that any vertex with |(T^(u) — c'^(f)| > 3A;^3'^~^en > (6(2)3'^"^ + l)en must necessarily 
be -y-out of place, completing the proof of the second part of the Lemma. 

For the final part observe that if u and v are a cr^/ir'^-inversion and not among the en out 
of place vertices then, by definition of out-of-place, there can be at most 2 • 6(2)3'^'~^en vertices 
between a^{v) and a^{u) in a^. Each u therefore only 24:(^^3^~^en possibilities for v. Therefore 
d{(J^,a^) < en^ + 2A{^/)3^-^en-n/2 < Qek'^^^'^n^. □ 

Our remaining analysis is deterministic, conditioned on the event of Lemma [T2] holding. 

6 Analysis of cr^ 

The following key Lemma shows the sensitivity of b{a,v,p) to its first and third arguments. 

Lemma 13. For any constraint system c with arity k > 2, orderings a and a' over vertex set 
T C.V , vertex v G V and p,p' gM. we have 

1. \b'^{a,v,p) — b'^{a' ,v,p)\ < ^{number of crossings) + ^d{a,a') 

2. \b''{a,v,p) -b''{a',v,p')\ < (^"2) (l^et flow\ + ky/d{a,a' 



where (Izl) = if k = 2, {net flow) is \{v eT : a'{v) > p'}\ - \{v gT : a{v) > p}\, and 
{number of crossings) is the number of v G T that do a {a,p,a' ,p')-crossing. 
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Proof. Fix a, a', T, v, p and p'. Let L (resp. R) denote the vertices in T that do left to right 
(resp. right to left) {a,p,a' ,p')-cicossmgs. It is easy to see that a constraint {v} UQ, Q G 
contributes identically to b{a,v,p) and b{a',v,p') unless either: 

1. Q and (L U R) have non-empty intersection (or) 

2. Q contains a cr/cr'-inversion 

The first part of the Lemma follows easily. 

Towards proving the second part we first bound \L\ + \R\. Observe that \L\ = \R\ + (net flow). 
Assume w.l.o.g. that (net flow) > 0. Observe that every pair v €z L and w € R are a aja'- 
mversion, hence d{a,a') > \L\ ■ \R\ = {\R\ + (net flow))\R\ > \R\^. We conclude that \L\ + \R\ = 
2\R\ + {net flow) < 2\J d{a, a') + {net flow). Therefore the number of constraints of the first type 
is at most 

(fc-2) (2\/d(o-, a') + {net flow)). 
To simplify we bound 



for sufficiently large n. □ 

Observe that the quantity net flow in Lemma [THl is zero whenever p = p' and a and a' are both 
rankings. Therefore we have the following useful corollary. 

Corollary 14. Let vr and it' be rankings over vertex set U and w a FAST instance over U . Then 
|6"'(7r, v,p) — b^{'ir' , v,p)\ < 2(maXj.^s 'Wrs)\/d{7r, tt') for all v and p G M \ Z. 

Lemma 15. For U in Algorithm^ we have \V\U\< l'k?(n-i-. = 0(f • ^). 



2ok-l 



en 



Proof. Observe that the number of vertices that cr^/cj^-cross a particular p is at most 2 • 6k^3 
by Lemma [T2] (first part). Therefore we apply Lemmas 1121 and 1131 yielding 

\b{a'',v,p) - b{a\v,p)\ < I 2) 12A:23^-ien + (^^Zf) e^t'S^-^en^ < Uek^s''-^ (^^ ~_ J) (4) 
for all V and p. 

Fix a non-costly v. By definition of costly b{a'^ ,v,a'^{v)) < 2{^)e{lzl) < k'^3''-^e(lzl) , hence 
b{a\v,a°{v)) < 13fc43'=-ie(;j:^i), sovGU. 

Finally recall Lemma [9l □ 

We define vr® to be the ranking induced by the restriction of vr* to U, i.e. vr® = Ranking{'K* u) ■ 

Lemma 16. All vertices in the unambiguous set U satisfy \a^{v) - TT®{v)\ = 0{en). 

Proof. The triangle inequality \o''^{v) — 7r®| < \a'^{v) — Tr*{v)\ + \tt*{v) — tt®\ allows us to instead 
bound the two terms \a'^ {v) — n* {v)\ and |7r*(t)) — vr® | separately by O (en). We bound {a"^ {v) — n* {v)\ 
first. 
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Since vr* is a ranking the number of vertices \B\ between 7r*{v) and cr'^iv) in vr* is at least 
|7r*(f) — cr^(f)| — 1. Therefore we have 

\Tr*{v) - a'^{v)\ -lfn-1 



, , <b{7T*,v,a (v)) + b{'K*,v,TT*{v)) (LemmadO 

(n — Ija" ^ ~ 1/ 

<2b{ir*,v,a'^{v)) (OptimaHty of vr*). (5) 

We next apply the first part of Lemma [T3l to vr* and a^, bounding the number of crossings and 
d[7r*,a^) using the definition = Round{'K*), yielding 

b{7r*,v, a\v)) < b{a^,v, a^v)) + 0{en''-^). (6) 
Next recalling Q from the proof of Lemma [15] we have 

b{a^,v, a\v)) < b{a\v, a\v)) + 0(en'=-^). (7) 
Combining ([5]), ([6]) and ([7]) we conclude that \t^*{v) — <t'^{v)\ = 0{en). 

Now we prove |vr*(f ) — 7r®| = 0{en). Lemma [151 the definition of vr®, and the assumption that 
OPT < imply that |7r®(t>) - Tr*{v)\ < .^wn^ix = 0(en). □ 

7 Analysis of tt^ 

Note that all orderings and costs in this section are over U, not V. We note that Lemma [15] and 
the assumption that OPT < e^n^ is small imply that \U\ = n — 0{e^n). 

Lemma 17. g^zTll — — '^uv + ^ '^('^^-2)' ^'^^ '^"^ ^"^ weighted FAST instance. 

Proof. We prove the more interesting lower-bound and leave the straightforward proof of the upper 
bound to the reader. Fix u,v G U. We consider two cases. 

If there are at least |f/|/3 vertices between u and v in then we note that by weak fragility 
every constraint S 5 {u, v} with all vertices in S between u and v in cr^ contributes at least 1 to 
Wuv + yJvu- Therefore Wuv + Wvu > — 2-3^-^ (fc-2) sufficiently large n and small e. 

If there are at most |C/|/3 vertices between u and v in then consider constraints with all their 
vertices either all before or all after u and v. We note that by weak fragility each such constraint 
S 5 {u, v} contributes at least 1 to Wuv + Wyu- There are clearly either at least \U\/3 vertices before 
or at least |f7|/3 vertices after, hence at least ('^l^^) > 2 3^-2 (fc-D constraints for sufficiently large 
n and small e. 

We conclude that Wuv + Wyu > (fc-2) — (fe-2) • '^^^ Lemma follows from the definition 

of id. □ 

Lemma 18. Assume ranking it and ordering a satisfy |vr('u) — cr(u)| = 0(en) for all u. For any 
u,v, let Nuv denote the number of S D {u,v} such that not all pairs {s,t} / {u,v} are in the same 
order in a and vr. We have N^^ = 0{en^~'^). 

Proof. Such a pair must satisfy \i^{s) — vr(i)| = 2 • 0{en), but few constraints contain such a 

pair. □ 

Lemma 19. The following inequalities hold: 

1. <:<<:+0(en^-2) 
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2. <:<(i+o(6))<,® 

Proof. The only constraints S D {n, v} that contribute differently to the left- and right-hand sides 
of the first part are those containing a {s,t} ^ {u,v} that are a cr^/7r®-inversion. By Lemmas [TBI 
and 1181 we can bound the number of such constraints by 0{en^), completing the proof of the first 
part. 

If t(7j[® > 2.3^k-i (' the second part follows from the first part and the trivial fact w < w. Oth- 
erwise by the first part we have w'^l < 0.6-^^ {^^^-2) ■ Therefore by Lemma[T7]u;5J^ > 0.2^^^ ('^|^^) 

hence wZl = wZl - min{0. 1 ^ {^^'^J^^) ,wZl) = min(u;J^' - O.lgFrr ('^L~2^) , 0) < mm{wZ® ,0) < wZ^ 
using the first part of the Lemma in the penultimate inequality. □ 

Lemma 20. 

1. C^'iir®) < {l + 0{e)){l)C{TT®) 

2. C'^'in^) < (l + 0(e))(^')C(7r®) 

3. C^"{7r^) - C^\7r®) = 0(eC7(7r®)) 

Proof. From the second part of Lemma [19] and Lemma [8] we conclude that 

C-'(7r®) < (l + 0(e))C'^®(7r®) = (1 + 0(e)) Q C(^®). 

proving the first part of this Lemma. 
The PTAS for FAST guarantees 

C"^'(^') < (l + 0(e))0'^'(7r®), (8) 

which combined with the first part of this Lemma yields the second part. 

Finally the first part of Lemma [19] followed by the first part of this Lemma imply 

C'"'(7r3) - 0'^'(7r®) < 0(e)C"'(7r®) < 0(60(71®)), 

completing the proof of the third part of this Lemma. □ 
Lemma 21. d{7r^,7r®) = 0{C{Tr®)/n^-^) 

Proof. TT^ and vr® both have cost at most 20(7r®) (Lemma I20[ first and second parts) for the FAST 
instance iv'^ (Lemma [T7j) . □ 

Lemma 22. We have \7r^{v) — tt®{v)\ = 0{en) for all v G U. 

2 2 

Proof. Fix V £ U. In this proof we write w (resp. ?D) as a short-hand for w'^ (resp. w'^ ). Observe 
that there are at least {\tt'^{v) — Tr®{v)\ — 1) vertices between ^^{v) and Tr®{v) + 1/2 in vr^. Any such 
vertex u must contribute Wuv to one of b"^ {tt'^ , v , n® {v) + 1/2) and b"^ (ir^ , v , tt^ (v)) and contribute 
Wvu to the other. By Lemma [17] and local optimality of vr^ we have 

(|7r3(t;)-vr®(t;)|-l)i^^^('^'_"2^) < b^{7r^v,7T®iv) + 1/2) + b^in^vyiv)) 

< 2b^{7r^,v,7r®{v) + l/2). 
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Now apply Corollary [T^ 



b'^{7r^,v,Tr®{v) + 1/2) < b'^{7r®,v, tt® + 2V(i(^®, 7r3)2 



and then recall ^^ d{iT® , n^) = 0{en) by Lemma [2T] and the assumption that OPT is small. 
Next 

^® 

b'"{TT®,v,TT®{v)) < il + 0{e))b'" (7r®,^;,7r®(?;)) (Second part of LemmadS]) 

= (l + 0(e))6(7r®,?;,7r®(t>)) (LemmaED (9) 

Finally 

bi7T®,v,TT®{v)) < b{a^,v,(7^{v)) + 0{n^-^{en + V^)) (Lemmas HSl 12] and USD 
= 0{en^-^) (veU). 

which completes the proof of the Lemma. □ 

Lemma 23. C{tt^) < (1 + 0(e))C7(^®). 

Proof. First we claim that 

|(C(7r3) - C7(7r®)) - {C^^Tr^) - C7-'(7r®))| < E^, (10) 

where Ei is the number of constraints that contain one pair of vertices n, v in different order in vr^ 
and TT® and another pair {s,t} ^ {u,v} with relative order in vr^, vr® and cr^ not all equal. Indeed 
constraints ordered identically in vr'^ and vr® contribute zero to both sides of pop . regardless of a"^. 
Consider some constraint S containing a 7r3/7r®-inversion {u, v} C S. If the restrictions of the three 
orderings to S are identical except possibly for swapping u, v then S contributes equally to both 
sides of (llOp . proving the claim. 

To bound Ei observe that the number of inversions u, v is ^(vr'^, vr®) = D. For any u, v Lemmas 
[22| 1161 and 1181 allow us to show at most 0{en^~'^) constraints containing {u,v} contribute to Ei, so 
El = 0{Den^-^) = 0{eC{-K®)) (LemmaEIJ. 

Finally bound C''\tt^) - C''\t:®) = C''\t:^) - C"\t:®) < 0(eC(^®)), where the equality 
follows from the definition of w and the inequality is the third part of Lemma [20l □ 



\U\-2 
k-2 



8 Analysis of tt^ 

In this section we prove Theorem [3) 

C(7r^) < (1 + 0(e))OPr (11) 

and 

If OPT > e^n^ then, as discussed in Section [31 Equation pi|) follows from the algorithm and 
the additive error guarantee. Equation ()12p is vacuous in this case. It remains to show (jlip and 
(fT2]) in the case that that Sections [MZl dealt with: OPT < e^n^ . 

First we prove (jlip . We consider three contributions to these costs separately: constraints with 
0, 1, or 2+ vertices m.V\U . 
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The contribution of constraints with vertices mV\U to the left- and right-hand sides of (jlip 
are clearly C{7r^) and C(7r®) respectively. We showed C{Tr^) < C{-k®) + 0(e)C(7r®) in Lemma [23 

Second we consider the contribution of constraints with exactly 1 vertex in V \ U . Consider 
some V ^ V \ U . We want to compare h{'K^ ,v,a'^{v)) and b{{'K*u),v,'K*{v)). Let p be the half- 
integer so that Ranking{v^p \ 7r®u) = Ranking{v^-K* (v) \ ir*u)- The algorithm's greedy choice 
minimizes b{7:^,v,a'^{v)) so b{7T^,v,a'^{v)) < b{TT^,v,p). Now using Lemmas [13] and [21] we have 
b{TT^,v,p) < b{^T®,v,p) + 0{^/d{^T\^n^-'^) = b{T^® ,v,p) + 0{^J OPT /n^n^-^). Note b{Tr®,v,p) = 
b{{TT*i/),v, TT*(v)). Let 7 = OPT/n'^ . We conclude by Lemma[T5]that the contribution of constraints 
with exactly 1 vertex in y \ [/ is 0{\V \ U\^JOPT /n'^n''-^) = 0{'^^^) = 0{eOPT). 

Finally by Lemma [TSl there are at most \ V\U\'^n^~'^ = 0((^)^n^n'^^^) = 0{e^OPT) constraints 
containing two or more vertices from V \ U . 

This ends the proof of pT]) . 

Finally we prove (|12|) . By Lemma [2T] we have 

(i(^3^^®) ^ 0{C{-n®)/n^-'^). 

Finally a pair of vertices can only be counted in (i(7r^,7r*) but not (i(7r^,7r®) if at least one of the 
vertices is in the ambiguous set V \ U . By Lemma [15] |y \ [/| = 0(- • ^t^) so there at most 
0{n . ^) = 0(£S) such pairs. 
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